Discovery of Discourse-Related Language Contrasts through Alignment Discrepancies in English-German Translation
نویسندگان
چکیده
In this paper, we analyse alignment discrepancies for discourse structures in English-German parallel data – sentence pairs, in which discourse structures in target or source texts have no alignment in the corresponding parallel sentences. The discourse-related structures are designed in form of linguistic patterns based on the information delivered by automatic part-of-speech and dependency annotation. In addition to alignment errors (existing structures left unaligned), these alignment discrepancies can be caused by language contrasts or through the phenomena of explicitation and implicitation in the translation process. We propose a new approach including new type of resources for corpus-based language contrast analysis and apply it to study and classify the contrasts found in our English-German parallel corpus. As unaligned discourse structures may also result in the loss of discourse information in the MT training data, we hope to deliver information in support of discourse-aware machine translation (MT).
منابع مشابه
Seeking Source Discourse Ideology by English and Persian Translators: A Comparative Think Aloud Protocol Study
Discourse audiences are susceptible to fall victims of the concealed ideological representations in discourses at the expanse of changing and modifying their mental models through which they act on the world. Translators as readers and at the same time intercultural mediators need to be equipped with the knowledge of how ideology is accommodated in discourse both not to fall victim to it and to...
متن کاملInvestigating Discourse Socialisation Progress of an English as a Second Language Learner Using Systematic Functional Linguistic Approach
This study was framed on the theory of Language Socialisation and a Systematic Functional Linguistic (SFL) approach. The aim of the study was to analyse the oral presentation discourse produced by an elemen- tary Iranian English as Second Language (ESL) postgraduate student in an American university four times (September/December, 2015 and March/September, 2016) over one year. The data were col...
متن کاملDiscovery of Ambiguous and Unambiguous Discourse Connectives via Annotation Projection
We present work on tagging German discourse connectives using English training data and a German-English parallel corpus, and report first results towards a more comprehensive approach of doing annotation projection for explicit discourse relations. Our results show that (i) an approach based on a dictionary of connectives currently has advantages over a simpler approach that uses word alignmen...
متن کاملLexical Translation in Movies: A Comparative Analysis of Persian Dubs and Subtitles through CDA
The impact of translated movies has already been emphasized by quiet a number of researchers. The present study aimed to investigate the strategies of the Iranian subtitlers and dubbers of English movies in rendering English words. To this aim, three theoretical frameworks were employed: The researchers first used the strategies proposed by Venuti (2012) and Newmark's (1991) classification and ...
متن کاملTransmission of Ideology through Translation: A Critical Discourse Analysis of Chomsky’s “Media Control” and its Persian Translations
Among factors that might manipulate translators’ mind while producing a text is the notion of ideology transmission through text or talk. Adopting Critical Discourse Analysis (CDA) with particular emphasis on the framework of Van Dijk (1999), the present investigation is an attempt to shed light on the relationship between language and ideology involved in translation in general, and more speci...
متن کامل